Challenges in Computational Statistics and Data Mining
نویسندگان
چکیده
منابع مشابه
Presented a method for estimating the cost of software using PCA to reduce the size and with the help of data mining
These days, data mining one of the most significant issues. One field data mining is a mixture of computer science and statistics which is considerably limited due to increase in digital data and growth of computational power of computer. One of the domains of data mining is the software cost estimation category. In this article, classifying techniques of learning algorithm of machine ...
متن کاملComputational Challenges in Data Mining
Data mining is applied in business to find new market opportunities from data stored in operational data bases which are used for dayto-day management. The tools applied combine ideas from statistics, machine learning, data base technology and high performance computing to find nuggets of knowledge. Data mining is also applied in science for example to find taxonomies of variable stars and in t...
متن کاملScalable Model-based Clustering Algorithms for Large Databases and Their Applications
With the unabated growth of data amassed from business, scientific and engineering disciplines, cluster analysis and other data mining functionalities, play a more and more important role. They can reveal previously unknown and potentially useful patterns and relations in large databases. One of the most significant challenges in data mining is scalability — effectively handling large databases...
متن کاملAccurate Recasting of Parameter Estimation Algorithms Using Sufficient Statistics for Efficient Parallel Speed-Up: Demonstrated for Center-Based Data Clustering Algorithms
Fueled by advances in computer technology and online business, data collection is rapidly accelerating, as well as the importance of its analysis—data mining. Increasing database sizes strain the scalability of many data mining algorithms. Data clustering is one of the fundamental techniques in data mining solutions. The many clustering algorithms developed face new challenges with growing data...
متن کاملEfficient Data Mining for Enabling Genome-wide Computing
The volume and diversity of the data acquired in biomedical applications offer unique challenges to data mining researchers. The solutions that effectively turn data into information and knowledge will advance not only the field of data mining but also the understanding of the underlying science. This paper will present new computational challenges faced by designing methods for enabling genome...
متن کامل